How fast do equilibrium payoff sets converge in repeated games?
نویسندگان
چکیده
We provide tight bounds on the rate of convergence of the equilibrium payo sets for repeated games under both perfect and imperfect public monitoring. The distance between the equilibrium payo set and its limit vanishes at rate (1− δ)1/2 under perfect monitoring, and at rate (1− δ)1/4 under imperfect monitoring. For strictly individually rational payo vectors, these rates improve to 0 (i.e., all strictly individually rational payo vectors are exactly achieved as equilibrium payo s for δ high enough) and (1− δ)1/2, respectively.
منابع مشابه
Reputation with Long Run Players
Previous work shows that reputation results may fail in repeated games with long-run players with equal discount factors. We restrict attention to stage games where two players, with equal discount factors, move sequentially. A pure Stackelberg action is assumed to exist. One and two sided reputation results are provided. If one of the players is a Stackelberg type with positive probability, th...
متن کاملReinforcement learning with restrictions on the action set
Consider a 2-player normal-form game repeated over time. We introduce an adaptive learning procedure, where the players only observe their own realized payoff at each stage. We assume that agents do not know their own payoff function, and have no information on the other player. Furthermore, we assume that they have restrictions on their own actions such that, at each stage, their choice is lim...
متن کاملA Learning Theory for the Harsanyi's Doctrine in Repeated Games
This paper investigates simultaneous learning about both nature and others’ actions in repeated games, and identifies a set of sufficient conditions assuring that equilibrium actions converge to a Nash equilibrium. Players have each an utility function over infinite histories continuous for the product topology. Nature’ drawing after any history can depend on any past actions, or can be indepen...
متن کاملThe Maximum Efficient Equilibrium Payoff in the Repeated Prisoners' Dilemma
We describe the maximum efficient subgame perfect equilibrium payoff for a player in the repeated Prisoners’ Dilemma, as a function of the discount factor. For discount factors above a critical level, every efficient, feasible, individually rational payoff profile can be sustained. For an open and dense subset of discount factors below the critical value, the maximum efficient payoff is not an ...
متن کاملExcess Payoff Dynamics, Potential Dynamics, and Stable Games
We consider a model of evolution in games in which agents occasionally receive opportunities to switch strategies, choosing between them using a probabilistic rule. Both the rate at which revision opportunities arrive and the probabilities with which each strategy is chosen are functions of current normalized payoffs. W e call the aggregate dynamics induced by this model excess payoff dynamics....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. Economic Theory
دوره 165 شماره
صفحات -
تاریخ انتشار 2016